nsposon sequencing count data set has two types of transposon

They are the insertion statistic and the site statistic. The former

r the number of transposon insertions per gene and the latter

r the number of transposon insertion sites per gene. These two

statistics represent two types of transposon events in a genome.

tion statistic represents the total number of transposon insertions

as. This value depends on the sequencing coverage. The greater

encing coverage, the greater the insertions per gene. The site

epresents the strength of the transposon distribution across a gene.

stic depends on two factors, i.e. the sequencing coverage and the

ntiality.

2.12 shows a few lines of one replicate of the transposon

ng data of the Francisella tularensis species [Yang, et al., 2017].

t these statistics were all gene-wise summary statistics. This

at both the insertions per gene and the sites per gene have been

sed from genome-wise data.

The first few lines of the first replicate of the Francisella Tularensis species.

nds for the insertion statistic. ‘sites’ stands for the site statistic. ‘MF’ stands for

n feature statistic, which is a convolution between ‘counts’ and ‘sites’.

ID#ymbol

counts

sites

MF

FTT_0001#dnaA

402

7

1.446

FTT_0002#dnaN

280

18

2.913

FTT_0004c#isftu1

7490

159

1428.707

FTT_0005a#-

766

37

140.050

FTT_0006#-

15466

212

2695.370

The transposon insertion spectrum of the gene dnaA (FTT_0001) in the first

the Francisella Tularensis species.